Speech recognition with amplitude and frequency modulations.

نویسندگان

  • Fan-Gang Zeng
  • Kaibao Nie
  • Ginger S Stickney
  • Ying-Yee Kong
  • Michael Vongphoe
  • Ashish Bhargave
  • Chaogang Wei
  • Keli Cao
چکیده

Amplitude modulation (AM) and frequency modulation (FM) are commonly used in communication, but their relative contributions to speech recognition have not been fully explored. To bridge this gap, we derived slowly varying AM and FM from speech sounds and conducted listening tests using stimuli with different modulations in normal-hearing and cochlear-implant subjects. We found that although AM from a limited number of spectral bands may be sufficient for speech recognition in quiet, FM significantly enhances speech recognition in noise, as well as speaker and tone recognition. Additional speech reception threshold measures revealed that FM is particularly critical for speech recognition with a competing voice and is independent of spectral resolution and similarity. These results suggest that AM and FM provide independent yet complementary contributions to support robust speech recognition under realistic listening situations. Encoding FM may improve auditory scene analysis, cochlear-implant, and audiocoding performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Role of Temporal Amplitude Modulations in the Political Arena: Hillary Clinton vs. Donald Trump

Speech is an acoustic signal with inherent amplitude modulations in the 1-9 Hz range. Recent models of speech perception propose that this rhythmic nature of speech is central to speech recognition. Moreover, rhythmic amplitude modulations have been shown to have beneficial effects on language processing and the subjective impression listeners have of the speaker. This study investigated the ro...

متن کامل

Clear speech perception in acoustic and electric hearing.

When instructed to speak clearly for people with hearing loss, a talker can effectively enhance the intelligibility of his/her speech by producing "clear" speech. We analyzed global acoustic properties of clear and conversational speech from two talkers and measured their speech intelligibility over a wide range of signal-to-noise ratios in acoustic and electric hearing. Consistent with previou...

متن کامل

Speech detection and SNR prediction basing on amplitude modulation pattern recognition

A sound classification algorithm is presented which estimates the signal-to-noise ratio between speech and noise in 15 different frequency channels. The algorithm bases on the extraction of spectro-temporal features from the acoustical waveform. The approach is motivated by neurophysiological findings on periodicity coding in the auditory system of mammals. The extracted feature patterns are ca...

متن کامل

Association of Auditory Steady State Responses with Perception of Temporal Modulations and Speech in Noise

Amplitude modulations in the speech convey important acoustic information for speech perception. Auditory steady state response (ASSR) is thought to be physiological correlate of amplitude modulation perception. Limited research is available exploring association between ASSR and modulation detection ability as well as speech perception. Correlation of modulation detection thresholds (MDT) and ...

متن کامل

Auditory-Visual Speech Recognition with Amplitude and Frequency Modulations

A recent study by Zeng et al (2005) [PNAS, 102, 2293-2298] demonstrated the importance of FM cues for auditory speech identification in a competing noise environment. The current speech identification study investigated this finding for both an Auditory Only (AO) and an Auditory-Visual (AV) speech in noise identification task. The results demonstrated an FM advantage (compared to AM only) for b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 102 7  شماره 

صفحات  -

تاریخ انتشار 2005